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Abstract 

Consider a random permutation ir £ <S n . In this paper, perhaps 
best classified as a contribution to discrete probability distribution 
theory, we study the first occurrence X = X n of a I-II-III-pattern, 
where "first" is interpreted in the lexicographic order induced by the 
3-subsets of [n] = {1,2, ... ,n}. Of course if the permutation is I- 
II-III-avoiding then the first I-II-III-pattern never occurs, and thus 
E(X) = oo for each n; to avoid this case, we also study the first 
occurrence of a I-II-III-pattern given a bijection / : Z + — > Z + . 



1 Introduction 

Consider a random permutation tt £ S n . In this short note, perhaps best 
classified as a contribution to discrete probability distribution theory (AMS 
Subject Classification 60C05), we study the first occurrence X = X n of a 
I-II-III-pattern, denned as follows: Order the 3-subsets of [n] = {1, 2, . . . , n} 
in the "obvious" lexicographic fashion 

{1,2,3} < {1, 2, 4} < {1,2,5} <...{l,2,n}<{l, 3,4} < {1,3,5} <... 

< {l,n-l,n} < {2,3,4} < . . . {n - 2, n - l,n}. 
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We say that the first I-II-III-pattern occurs at {a, b, c} if n(a) < 7r(6) < 7r(c) 
and if 7i(d) < 7r(e) < 7r(/) does not hold for any {d, e, /} < {a,b,c}. Of 
course if the permutation is I-II-III-avoiding, which occurs ([]]) with proba- 
bility ( 2 ^) f{n + 1)!, then X = oo and the first I-II-III-pattern never occurs. 
Consequently E(X) = oo for each n; to avoid this case, we also simulta- 
neously present results on the first occurrence of a I-II-III-pattern given a 
bijection / : Z+ -> Z+. 



2 Results 

In what follows, we use the notation X = a&c as short for the event {X = 
{a, b, c}}, and refer to the case of a bijection on Z + as the n = oo case. 

Proposition 1 For each n < oo, 

F(X = 12r) = — -. 

r — 1 r 

Proof. Let 7r(l),7r(2), . . .7r(r) be ordered increasingly as Xi < 22 < . . - x r . 
Then, if 7r(2) = x r _i and n(r) = x r , we clearly have X = 12r. Conversely if 
the second largest of {7r(l), vt(2), . . . vr(r)} is not in the 2 nd spot, then either 
7r(2) = x r , or 7r(2) < a; r _i. In the former case, the only way that we can 
have X = 12s is with s > r. If tt(2) < x r _i, there are two possibilities: If 
7r(l) < 7r(2), then X = 12s for some s < r, and if 7r(l) > tt(2) then X 7^ 12s 
for any s. Thus we must have 7r(2) = x r _i. Now for X to equal 12r, we must 
necessarily have ir(r) = x r , or else we would have an earlier occurrence of a 
I-II-III pattern. It now follows that X = 12r if and only if 7r(2) = x r _i and 
7r(r) = x r with the other values arbitrary, so that P(X = 12r) = *- r ~ 2 ^ ! = 
^-r — -, as desired. □ 

'i — 1 r> 

Proposition 2 If n = 00, then X = lsr for some 2 < s < r with probability 
one. 

Proof. Obvious. No matter what value f(l) assumes, there is an s with 
f(s) > /(l) and an r > s with /(r) > f(s). Let so, r be the smallest such 
indices; this yields X = lsor . □ 
Our ultimate goal is to try to determine the entire probability distribution 
of X; for n = 6, for example, we can check that the ensemble {P(X = abc) : 
1 < a < b < c < 6} is as follows: 
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Table 1 

The First Occurrence of a I-II-III Pattern when n = 6 



First I-II-III Pattern 


Probability 


Cumulative Pr 


123 


120/720 


0.1666 


124 


60/720 


0.2500 


125 


36/720 


0.3000 


126 


24/720 


0.3333 


134 


50/720 


0.4028 


135 


28/720 


0.4417 


136 


18/720 


0.4667 


145 


26/720 


0.5028 


146 


16/720 


0.5250 


156 


16/720 


0.5472 


234 


48/720 


0.6139 


235 


22/720 


0.6444 


236 


12/720 


0.6611 


245 


24/720 


0.6944 


246 


12/720 


0.7111 


256 


14/720 


0.7306 


345 


24/720 


0.7639 


346 


10/720 


0.7778 


356 


14/720 


0.7972 


456 


14/720 


0.8167 


oo, i.e. never 


132/720 


1.0000 



Recall that the median of any random variable X is any number m such that 
F(X < m) > 1/2 and F(X > m) > 1/2. Now Propositions 1 and 2 together 
reveal that for n = oo, 



oo 

P(X<134)>£_7-- = - 



r=3 

and 



1 11 
P(X>134) = l-£— j-- = - r 

r=3 

which shows that X has 134 as its unique median. For finite n, however, the 
median is larger - Table 1 reveals, for example, that m = 145 for n = 6. 
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Proposition 3 For n = oo, 



P(X = lsr) = 



[s — l)(r — l)r s — 1 \r — 1 r 



Proof. It can easily be proved, as in Proposition 1 and keeping in mind that 
n = oo, that X = lsr iff n(s) = x r _i, 7r(r) = x r ; and 7r(l) = max 1 <j< s _ 1 n(j). 
It now follows that 

( r -*)(s-2)\(r-s-l)\ 1 



r! (s — l)(r — l)r 

as claimed. □ 
Proposition 3 provides us with the entire distribution of X when n = oo; 
note that 

V V — (J— - - V — - 1 - i 

Z^Z^ s — 1 V 7~ — 1 r / ^ s - 1 s 

s=2 r=s+l v 7 s=2 

The probability of the first I-II-III pattern occurring at positions 12r is the 
same for all n < oo, noting, though, that for finite n, ^ s P(X = 12s) = \ — K 
There is, however, a subtle and fundamental difference in general between 
P(X = Irs), r > 3, when n = oo and when n is finite. We illustrate this fact 
for F(X = 13r) when n < oo. Recall from the proof of Proposition 3 that for 
X to equal 13r in the infinite case, we had to have 7r(3) = x r _i,7r(r) = x r , 
and 7r(2) < 7r(l). The above scenario wZ/ stz//, in the finite case, cause the 
first I-II-III pattern to occur at positions 13r, but there is another case to 
consider. If n — 7r(2) > 7t(1) then it is impossible for X to equal 12s for any 
s; in this case we must have 7r(3) = x r _2 and 7r(r) = The probability 

of this second scenario is 



3)! 1 



n(r — 1)! n(r — 2)(r — 1) 



Adding, we see that 



P(x = 13r) = if^ T _lV 1/ 1 



2 \ r — 1 r / n \ r- 2 
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and, in contrast to the n = oo case where the net contribution of F(X = 
13r; r > 4) was 1/6, we have 

T*^4 T" = -^ 

1 



6 n(n — 1) 
The above example illustrates a general fact: 
Theorem 4 For /mite n ; 

4^ ( s l 2 ) TTi) (s - k - 2)!(r - s - 1)! 
¥(X = lsr) = V k Ma-fc-iA J_y >_ 

1 ] n(n — l)...(n — k + l)(r - fc)! ' 

Proof. We may have k of the quantities 7r(2), 7r(3), . . . , 7r(s — 1) being greater 
than 7r(l), where fc ranges from to s — 2. In this case, these 7rs must equal, 
from left to right, (n, n — 1, . . . , n — k + 1). Arguing as before, we must have 
7r(s) = 2V-i-fc an d 7r(r) = :r r _fc. The rest of the proof is elementary. □ 
Unlike the infinite case, ^ s ^ r F(X = lsr) ^ 1. So how much is F(X > 
234), or alternatively, how close to unity is 

' kh.k "(» -!)...(» -*+D(r-t)l ' 

We obtain the answer in closed form as follows: Conditioning on 7r(l), we 
see that X > 234 iff for j = n, n — 1, . . . , 1, 7r(l) = j, and the integers 
— ... ,,7 + 1 appear from left to right in 7r. Summing the corresponding 
probabilities 1/n, l/n,l/(2!n), l/(3!n), etc yields 

Proposition 5 

F(X > 234) ~ -. 

n 

3 Open Problems 

• Lexicographic ordering is not our only option; in fact it is somewhat 
unnatural. Consider another possibility: What is 

inf{A; : there is a I — II — III pattern in (7r(l), . . . , 7r(/c))}? 
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This question is not too hard to answer from Stanley- Wilf theory. Since 



P((tt(1), . . . , n(k)) is I - II - III free) 




(k + iy: 



the probability that k is the first integer for which (7r(l), . . . , 7r(fc)) 
contains a I-II-III pattern is 



A more interesting question is the following: Conditional on the fact 
that first I-II-III pattern occurs only after the kth "spot" is revealed, 
what is the distribution of the first 3-subset, interpreted in the sense 
of this paper, that causes this to occur? For example, if we let n = 
6, and are told that the first k for which there is a I-II-II pattern 
in (7r(l), . . . , n(k)) is 5, what is the chance that the first set that 



causes this to happen is {1, 2, 5}, {1, 3, 5}, {1, 4, 5}, {2, 3, 5}, {2, 4, 5} or 



• Can the results of this paper be readily generalized to other patterns 
of length 3? To patterns of length 4? 

• Theorem 4 and Proposition 5 fall short of providing the exact proba- 
bilities P(X = rst) for r > 2. Can these admittedly small probabilities 
be computed exactly or to a high degree of precision? 

• Does the distribution of X consist, as it does for n = 6, of a series of 
decreasing segments with the initial probability of segment j + 1 no 
smaller than the final probability of segment p. 
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k\ 



(fc + l)!' 



{3,4,5}? 
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